Efficient Sparse Cholesky Factorization on a Massively Parallel SIMD Computer
نویسندگان
چکیده
We investigate the effect of load balancing when performing Cholesky factorization on a massively parallel SIMD computer. In particular we describe a supernodal algorithm for performing sparse Cholesky factorization. The way the matrix is mapped onto the processors has significant effect on its efficiency. We show that this assignment problem can be modeled as a graph coloring problem in a weighted graph. By a simple greedy algorithm, we obtain substantial speedup compared with previously suggested data mapping schemes. Experimental runs have been made on a 16K processor MasPar MP-2 parallel computer using symmetric test matrices with irregular sparsity structure. On these problems our implementation achieves performance rates of well above 200 Mflops in double precision arithmetic. Key words, sparse matrix algorithms, Cholesky factorization, systems of linear equations, parallel computing, data parallel algorithms, MasPar, graph coloring AMS subject classifications. 05C50, 05C85, 15A23, 65F05, 65F50
منابع مشابه
Eecient Sparse Cholesky Factorization on a Parallel Simd Computer
We investigate the eeect of load balancing when performing Cholesky factor-ization on a SIMD computer. In particular we describe a supernodal algorithm for performing sparse Cholesky factorization. The way the matrix is mapped onto the processors has signiicant eeect on its eeciency. We show that this assignment problem can be modeled as a graph coloring problem in a weighted graph. By a simple...
متن کاملHighly Parallel Sparse Cholesky Factorization
We develop and compare several fine-grained parallel algorithms to compute the Cholesky factorisation of a sparse matrix. Our experimental implementations are on the Connection Machine, a distributedmemory SIMD machine whose programming model conceptually supplies one processor per data element. In contrast to special-purpose algorithms in which the matrix structure conforms to the connection s...
متن کاملScalable Parallel Algorithms for Solving Sparse Systems of Linear Equations∗
We have developed a highly parallel sparse Cholesky factorization algorithm that substantially improves the state of the art in parallel direct solution of sparse linear systems—both in terms of scalability and overall performance. It is a well known fact that dense matrix factorization scales well and can be implemented efficiently on parallel computers. However, it had been a challenge to dev...
متن کاملA PERFORMANCE STUDY OF SPARSE CHOLESKY FACTORIZATION ON INTEL iPSC/860
The problem of Cholesky factorization of a sparse matrix has been very well investigated on sequential machines. A number of efficient codes exist for factorizing large unstructured sparse matrices, for example, codes from Harwell Subroutine Library [4] and Sparspak [7]. However, there is a lack of such efficient codes on parallel machines in general, and distributed memory machines in particul...
متن کاملParallel Sparse Cholesky Factorization with Spectral Nested Dissection Ordering
We show that the use of the spectral nested dissection (SND) ordering leads to an efficient parallel sparse Cholesky factorization on an Intel Paragon multiprocessor. I Department of Computer Science, Old Dominion University, Norfolk, VA 23529-0162 and ICASE, NASA Langley Research Center, Hampton, VA 236810001 (pothsnlcs .odu. edu, pothsnOicaso, sdu). The research of the first and fourth author...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- SIAM J. Scientific Computing
دوره 16 شماره
صفحات -
تاریخ انتشار 1995